Machine Learning Models for Housing Prices Forecasting using Registration Data

نویسندگان

Mehdi Farahzadi Islamic Azad University

Mohammad Hassan Behzadi Islamic Azad University

Rahman Farnoosh Iran University of Science and Technoligy

چکیده مقاله:

This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient Boosting Regression Algorithm (XGBR), and the Long Short-Term Memory Neural Network Algorithm (LSTM). This research has been done using the data of the Statistics Center of Iran, which contains information on the purchase and sale of residential units in Tehran in the years 2014 to 2020 and includes 998299 transactions and 11 features. Loss of data, batch data conversion, normalization, etc. are performed on the housing data set to obtain the final and error-free data set. To divide the data set into training and test data sets, the important and practical method of cross-validation or K-Fold has been used because of its simplicity and effectiveness and as a universally valid method. Various evaluation criteria such as MSE, RMSE, MAE,ME and R2 were used to compare the models and identify the best model. Comparison of models in terms of all evaluation criteria in all K-fold subsets proves the stability and superiority of the Extreme Gradient Boosting Regression model.

Download for Free

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Hierarchical Bayesian Models for Large Space Time Data of the Housing Prices in Tehran

Housing price data is correlated to their location in different neighborhoods and their correlation is type of spatial (location). The price of housing is varius in different months, so they also have a time correlation. Spatio-temporal models are used to analyze this type of the data. An important purpose of reviewing this type of the data is to fit a suitable model for the spatial-temporal an...

متن کامل

Forecasting Housing Prices under Different Submarket Assumptions

This research evaluated forecasting accuracy of hedonic price models based on a number of different submarket assumptions. Using home sale data for the City of Knoxville and vicinities merged with geographic information, we found that forecasting housing prices with submarkets defined using expert knowledge and by school district and combining information conveyed in different modeling strategi...

متن کامل

Machine Learning for Better Models for Predicting Bond Prices

Bond prices are a reflection of extremely complex market interactions and policies, making prediction of future prices difficult. This task becomes even more challenging due to the dearth of relevant information, and accuracy is not the only consideration–in trading situations, time is of the essence. Thus, machine learning in the context of bond price predictions should be both fast and accura...

متن کامل

Dust source mapping using satellite imagery and machine learning models

Predicting dust sources area and determining the affecting factors is necessary in order to prioritize management and practice deal with desertification due to wind erosion in arid areas. Therefore, this study aimed to evaluate the application of three machine learning models (including generalized linear model, artificial neural network, random forest) to predict the vulnerability of dust cent...

متن کامل

Forecasting Agricultural Commodity Prices Using Multivariate Bayesian Machine Learning Regression by

Readers may verbatim copies of this document for non-commercial purposes by any means, provided that this copyright notice appears on all such copies. The purpose of this paper is to perform multiple predictions for agricultural commodity prices (one, two and three month periods ahead). In order to obtain multiple-time-ahead predictions, this paper applies the Multivariate Relevance Vector Mach...

متن کامل

a new approach to credibility premium for zero-inflated poisson models for panel data

هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...

15 صفحه اول

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

عنوان ژورنال

Journal of Statistical Research of Iran

دوره 17 شماره 1

صفحات 191- 214

تاریخ انتشار 2020-08

دنبال کردن

لغو دنبال کردن

{@ msg @}

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

کلمات کلیدی

میزبانی شده توسط پلتفرم ابری doprax.com